Backport multi-format tool parser to v0.12.0#2
Conversation
|
Streaming mode is not supported at the moment |
c619f38 to
677cef0
Compare
- Add xllm.py with GroupRMSNorm (per-group variance, layernorm_num_groups) - Register XllmForCausalLM in the model registry - Supports dense xllm checkpoints (Qwen3MoE-based architecture, no MoE) - Tested with 8B checkpoint on 8x H200 (TP=8) Co-Authored-By: claude-flow <ruv@ruv.net>
Testing the XllmForCausalLM supportThis PR (with the xllm commit) was verified end-to-end against a real xllm 8B checkpoint. 1. Installgit clone --branch tgat/multi-format-tool-parser-v0.12.0 https://github.com/LLM360/vllm.git
cd vllm
uv venv --python 3.12 .venv
VLLM_USE_PRECOMPILED=1 uv pip install --editable .2. Serve a checkpoint with
|
✅ Test results on xllm 8B checkpointRan the test plan from the previous comment against a real xllm 8B checkpoint ( .venv/bin/vllm serve /path/to/xllm/checkpoint \
--trust-remote-code \
--tensor-parallel-size 8 \
--port 8000 \
--reasoning-parser k2_v3 \
--enable-auto-tool-choice \
--tool-call-parser multi_formatLogs confirm architecture resolution: Parser + model test suite: 14/15 passed
Math answers being correct is the key signal that What's verified by this run
End-to-end sanity: Harbor taskAlso ran a full Harbor |
Summary
multi_formattool parser support fordefault,qwen3,glm,minimax,dsv32,gptoss, andpythonchat-template formatschat_template_kwargsinto compatible tool parsers without breaking legacy parser constructorsglmandpythonparsing with the exact chat-template output formatTesting
.venv/bin/python -m ruff check vllm/entrypoints/openai/serving_chat.py vllm/entrypoints/openai/serving_engine.py vllm/entrypoints/openai/serving_responses.py vllm/entrypoints/openai/tool_parsers/__init__.py vllm/entrypoints/openai/tool_parsers/multi_format_tool_parser.py tests/entrypoints/openai/test_tool_parser_kwargs.py tests/entrypoints/openai/tool_parsers/test_multi_format_tool_parser.py.venv/bin/python -m pytest tests/entrypoints/openai/test_tool_parser_kwargs.py tests/entrypoints/openai/tool_parsers/test_multi_format_tool_parser.py -v